Extracting Assumptions from Incomplete Data

نویسندگان

  • Honglei Zeng
  • Richard Fikes
چکیده

Information integration is the task of aggregating data from multiple heterogeneous data sources. The understandings of semantics and context knowledge of data sources are often the keys to challenging problems in information integration such as schema alignments and inconsistency resolution. Context logic provides a unified framework for the modeling of data sources; nevertheless, the acquisition of large amounts of context knowledge is difficult and possibly infeasible. In this paper, we study the importance of a special type of context knowledge, namely assumption knowledge, in information integration. Assumption knowledge refers to a set of implicit rules about assumptions and biases on which a data source is based. We develop a decision tree classifier to extract assumption knowledge from incomplete data and formalize the knowledge in context logic. Finally, we build an information aggregator with assumption knowledge reasoning, which is capable of explaining incomplete data aggregated from heterogeneous sources.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extracting Assumptions from Missing Data

Information integration is the task of aggregating data from multiple heterogeneous data sources. The understandings of context knowledge of data sources are often the keys to challenging problems in information integration such as handling missing and inconsistent data. Context logic provides a unified framework for the modeling of data sources; nevertheless, the acquisition of large amounts o...

متن کامل

پیش فرض ها و ارزش های فرهنگ سازمانی اسلامی: پژوهشی در چارچوب مدل فرهنگ سازمانی شاین

Edgar Schein has introduced a model in organizational culture. He claims that the origin of every culture is its assumptions which are bases for that culture’s values. Culture symbols, then, will be unfolded based on those values which have derived from culture’s assumptions. In this paper, in order to approach “Complete Organization” type, based on Schein’s model, the assumptions and values of...

متن کامل

Extracting reliable data from the fetal MCG

Extracting reliable information from a fetal MCG measured before the 24 week of gestation is hampered due to the poor signal-to-noise ratio. Thence, the recorded signals need to be processed in order to separate noise and signal, and need to be displayed in such a way that a reliable diagnosis can be made. No signal processing can be performed without making assumptions on either noise or signa...

متن کامل

Extracting regular mobility patterns from sparse CDR data without a priori assumptions

In this work we present two methods that can extract habitual movement patterns and reconstruct the underlying movement of users from their call detail records (CDR) in a way that works for users with only moderate numbers of CDRs and that does not make any prior assumptions on the behaviour of the users. The methods allow for a more comprehensive user base in large-scale studies due to the fac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005